Synthesising Reinforcement Learning Policies Through Set-Valued Inductive Rule Learning
نویسندگان
چکیده
Today’s advanced Reinforcement Learning algorithms produce black-box policies, that are often difficult to interpret and trust for a person. We introduce policy distilling algorithm, building on the CN2 rule mining distills into rule-based decision system. At core of our approach is fact an RL process does not just learn policy, mapping from states actions, but also produces extra meta-information, such as action values indicating quality alternative actions. This meta-information can indicate whether more than one near-optimal certain state. extend make it able leverage knowledge about equally-good actions distill fewer rules, increasing its interpretability by Then, ensure rules explain valid, non-degenerate we refinement algorithm fine-tunes obtain good performance when executed in environment. demonstrate applicability Mario AI benchmark, complex task requires modern reinforcement learning including neural networks. The explanations capture learned only few allow person understand what agent learned. Source code: https://gitlab.ai.vub.ac.be/yocoppen/svcn2.
منابع مشابه
Rule Combination in Inductive Learning
This paper describes the work on methods for combining rules obtained by machine learning systems. Three methods for obtaining the classification of examples with those rules are compared. The advantages and disadvantages of each method are discussed and the results obtained on three real world domains are commented. The methods compared are: selection of the best rule; PROSPECTOR-like probabil...
متن کاملAdvances in Inductive Rule Learning
A extension to the k-nearest neighbor classifier is described in which automatically induced rules are used as binary features, which are active in an instance when the left-hand side of the corresponding rule matches with the instance. The ripper rule induction algorithm is employed to produce the rules. The similarity between a memory instance and a new instance is based on the rules the two ...
متن کاملImitative Policies for Reinforcement Learning
We discuss a reinforcement learning framework where learners observe experts interacting with the environment. Our approach is to construct from these observations exploratory policies which favor selection of actions the expert has taken. This imitation strategy can be applied at any stage of learning, and requires neither that information regarding reinforcement be conveyed from the expert to...
متن کاملLearning Relational Options for Inductive Transfer in Relational Reinforcement Learning
In reinforcement learning problems, a learning agent has the task of learning a good or optimal strategy from interaction with his environment. At the start of the learning task, the agent usually has very little information. Therefore, when faced with complex problems that have a large state space, learning a good strategy might be infeasible or too slow to work in practice. One way to overcom...
متن کاملInductive Rule Learning on the Knowledge
We present an approach to learning sets of recursive rules based on analytical inductive programming. We propose that our approach can be used within cognitive architectures to model regularity detection and generalization over experience. Induced rule sets represent the knowledge underlying systematic behavior in complex situations. Such rule sets can explain systematicity and productivity of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-73959-1_15